Overview
Brought to you by YData
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 2747 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 300.6 KiB |
| Average record size in memory | 112.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 6 |
| DateTime | 1 |
address_stability_missing is highly overall correlated with cust_income_log and 1 other fields | High correlation |
age is highly overall correlated with job_stability_missing | High correlation |
cust_income is highly overall correlated with cust_income_log | High correlation |
cust_income_log is highly overall correlated with address_stability_missing and 1 other fields | High correlation |
employment is highly overall correlated with address_stability_missing and 1 other fields | High correlation |
job_stability_missing is highly overall correlated with age and 2 other fields | High correlation |
job_stability_years is highly overall correlated with job_stability_missing | High correlation |
address_stability_missing is highly imbalanced (84.6%) | Imbalance |
cocunut is uniformly distributed | Uniform |
cocunut has unique values | Unique |
years_with_bank has 276 (10.0%) zeros | Zeros |
cust_income has 76 (2.8%) zeros | Zeros |
cust_income_log has 76 (2.8%) zeros | Zeros |
Reproduction
| Analysis started | 2025-05-17 15:11:25.866646 |
|---|---|
| Analysis finished | 2025-05-17 15:11:46.451654 |
| Duration | 20.59 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
cocunut
Real number (ℝ)
Uniform  Unique 
| Distinct | 2747 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81374 |
| Minimum | 80001 |
|---|---|
| Maximum | 82747 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.6 KiB |
Quantile statistics
| Minimum | 80001 |
|---|---|
| 5-th percentile | 80138.3 |
| Q1 | 80687.5 |
| median | 81374 |
| Q3 | 82060.5 |
| 95-th percentile | 82609.7 |
| Maximum | 82747 |
| Range | 2746 |
| Interquartile range (IQR) | 1373 |
Descriptive statistics
| Standard deviation | 793.13492 |
|---|---|
| Coefficient of variation (CV) | 0.0097467854 |
| Kurtosis | -1.2 |
| Mean | 81374 |
| Median Absolute Deviation (MAD) | 687 |
| Skewness | 0 |
| Sum | 2.2353438 × 108 |
| Variance | 629063 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 80001 | 1 | < 0.1% |
| 81835 | 1 | < 0.1% |
| 81827 | 1 | < 0.1% |
| 81828 | 1 | < 0.1% |
| 81829 | 1 | < 0.1% |
| 81830 | 1 | < 0.1% |
| 81831 | 1 | < 0.1% |
| 81832 | 1 | < 0.1% |
| 81833 | 1 | < 0.1% |
| 81834 | 1 | < 0.1% |
| Other values (2737) | 2737 |
| Value | Count | Frequency (%) |
| 80001 | 1 | |
| 80002 | 1 | |
| 80003 | 1 | |
| 80004 | 1 | |
| 80005 | 1 | |
| 80006 | 1 | |
| 80007 | 1 | |
| 80008 | 1 | |
| 80009 | 1 | |
| 80010 | 1 |
| Value | Count | Frequency (%) |
| 82747 | 1 | |
| 82746 | 1 | |
| 82745 | 1 | |
| 82744 | 1 | |
| 82743 | 1 | |
| 82742 | 1 | |
| 82741 | 1 | |
| 82740 | 1 | |
| 82739 | 1 | |
| 82738 | 1 |
age
Real number (ℝ)
High correlation 
| Distinct | 54 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.736804 |
| Minimum | 21 |
|---|---|
| Maximum | 74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.6 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 27.3 |
| Q1 | 38 |
| median | 49 |
| Q3 | 60 |
| 95-th percentile | 69 |
| Maximum | 74 |
| Range | 53 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 13.186774 |
|---|---|
| Coefficient of variation (CV) | 0.27057117 |
| Kurtosis | -1.0782457 |
| Mean | 48.736804 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.079222786 |
| Sum | 133880 |
| Variance | 173.89101 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 41 | 77 | 2.8% |
| 65 | 75 | 2.7% |
| 45 | 74 | 2.7% |
| 67 | 72 | 2.6% |
| 62 | 71 | 2.6% |
| 48 | 71 | 2.6% |
| 63 | 70 | 2.5% |
| 44 | 69 | 2.5% |
| 39 | 67 | 2.4% |
| 66 | 66 | 2.4% |
| Other values (44) | 2035 |
| Value | Count | Frequency (%) |
| 21 | 4 | 0.1% |
| 22 | 15 | 0.5% |
| 23 | 14 | 0.5% |
| 24 | 25 | |
| 25 | 24 | |
| 26 | 30 | |
| 27 | 26 | |
| 28 | 31 | |
| 29 | 59 | |
| 30 | 44 |
| Value | Count | Frequency (%) |
| 74 | 6 | 0.2% |
| 73 | 8 | 0.3% |
| 72 | 12 | 0.4% |
| 71 | 30 | 1.1% |
| 70 | 32 | |
| 69 | 58 | |
| 68 | 51 | |
| 67 | 72 | |
| 66 | 66 | |
| 65 | 75 |
years_with_bank
Real number (ℝ)
Zeros 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.9912632 |
| Minimum | 0 |
|---|---|
| Maximum | 15 |
| Zeros | 276 |
| Zeros (%) | 10.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 13 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 4.5228154 |
|---|---|
| Coefficient of variation (CV) | 0.64692393 |
| Kurtosis | -1.4164684 |
| Mean | 6.9912632 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.15373129 |
| Sum | 19205 |
| Variance | 20.45586 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 413 | |
| 0 | 276 | |
| 12 | 275 | |
| 10 | 234 | |
| 4 | 230 | |
| 2 | 197 | |
| 3 | 176 | 6.4% |
| 9 | 172 | 6.3% |
| 1 | 171 | 6.2% |
| 13 | 128 | 4.7% |
| Other values (6) | 475 |
| Value | Count | Frequency (%) |
| 0 | 276 | |
| 1 | 171 | |
| 2 | 197 | |
| 3 | 176 | |
| 4 | 230 | |
| 5 | 119 | |
| 6 | 86 | 3.1% |
| 7 | 76 | 2.8% |
| 8 | 99 | 3.6% |
| 9 | 172 |
| Value | Count | Frequency (%) |
| 15 | 32 | 1.2% |
| 14 | 63 | 2.3% |
| 13 | 128 | 4.7% |
| 12 | 275 | |
| 11 | 413 | |
| 10 | 234 | |
| 9 | 172 | |
| 8 | 99 | 3.6% |
| 7 | 76 | 2.8% |
| 6 | 86 | 3.1% |
marital_status
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.6 KiB |
| M | |
|---|---|
| S | |
| W | 177 |
| D | 162 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | W |
| 3rd row | M |
| 4th row | D |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 1930 | |
| S | 478 | 17.4% |
| W | 177 | 6.4% |
| D | 162 | 5.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 1930 | |
| s | 478 | 17.4% |
| w | 177 | 6.4% |
| d | 162 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1930 | |
| S | 478 | 17.4% |
| W | 177 | 6.4% |
| D | 162 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 1930 | |
| S | 478 | 17.4% |
| W | 177 | 6.4% |
| D | 162 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 1930 | |
| S | 478 | 17.4% |
| W | 177 | 6.4% |
| D | 162 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 1930 | |
| S | 478 | 17.4% |
| W | 177 | 6.4% |
| D | 162 | 5.9% |
education
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.6 KiB |
| HGH | |
|---|---|
| BCR | |
| OTH | 110 |
| MAS | 24 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HGH |
|---|---|
| 2nd row | OTH |
| 3rd row | BCR |
| 4th row | BCR |
| 5th row | HGH |
Common Values
| Value | Count | Frequency (%) |
| HGH | 1872 | |
| BCR | 741 | 27.0% |
| OTH | 110 | 4.0% |
| MAS | 24 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hgh | 1872 | |
| bcr | 741 | 27.0% |
| oth | 110 | 4.0% |
| mas | 24 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 3854 | |
| G | 1872 | |
| B | 741 | 9.0% |
| C | 741 | 9.0% |
| R | 741 | 9.0% |
| O | 110 | 1.3% |
| T | 110 | 1.3% |
| M | 24 | 0.3% |
| A | 24 | 0.3% |
| S | 24 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8241 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| H | 3854 | |
| G | 1872 | |
| B | 741 | 9.0% |
| C | 741 | 9.0% |
| R | 741 | 9.0% |
| O | 110 | 1.3% |
| T | 110 | 1.3% |
| M | 24 | 0.3% |
| A | 24 | 0.3% |
| S | 24 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8241 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| H | 3854 | |
| G | 1872 | |
| B | 741 | 9.0% |
| C | 741 | 9.0% |
| R | 741 | 9.0% |
| O | 110 | 1.3% |
| T | 110 | 1.3% |
| M | 24 | 0.3% |
| A | 24 | 0.3% |
| S | 24 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8241 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| H | 3854 | |
| G | 1872 | |
| B | 741 | 9.0% |
| C | 741 | 9.0% |
| R | 741 | 9.0% |
| O | 110 | 1.3% |
| T | 110 | 1.3% |
| M | 24 | 0.3% |
| A | 24 | 0.3% |
| S | 24 | 0.3% |
employment
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.6 KiB |
| PVE | |
|---|---|
| STE | |
| RET | |
| MISC |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.043684 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PVE |
|---|---|
| 2nd row | RET |
| 3rd row | STE |
| 4th row | MISC |
| 5th row | PVE |
Common Values
| Value | Count | Frequency (%) |
| PVE | 1186 | |
| STE | 742 | |
| RET | 699 | |
| MISC | 120 | 4.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pve | 1186 | |
| ste | 742 | |
| ret | 699 | |
| misc | 120 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2627 | |
| T | 1441 | |
| P | 1186 | |
| V | 1186 | |
| S | 862 | 10.3% |
| R | 699 | 8.4% |
| M | 120 | 1.4% |
| I | 120 | 1.4% |
| C | 120 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8361 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 2627 | |
| T | 1441 | |
| P | 1186 | |
| V | 1186 | |
| S | 862 | 10.3% |
| R | 699 | 8.4% |
| M | 120 | 1.4% |
| I | 120 | 1.4% |
| C | 120 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8361 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 2627 | |
| T | 1441 | |
| P | 1186 | |
| V | 1186 | |
| S | 862 | 10.3% |
| R | 699 | 8.4% |
| M | 120 | 1.4% |
| I | 120 | 1.4% |
| C | 120 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8361 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 2627 | |
| T | 1441 | |
| P | 1186 | |
| V | 1186 | |
| S | 862 | 10.3% |
| R | 699 | 8.4% |
| M | 120 | 1.4% |
| I | 120 | 1.4% |
| C | 120 | 1.4% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 1379 | |
| F | 1368 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 1379 | |
| f | 1368 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1379 | |
| F | 1368 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 1379 | |
| F | 1368 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 1379 | |
| F | 1368 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 1379 | |
| F | 1368 |
cust_income
Real number (ℝ)
High correlation  Zeros 
| Distinct | 2370 |
|---|---|
| Distinct (%) | 86.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 368.59353 |
| Minimum | 0 |
|---|---|
| Maximum | 7978.9615 |
| Zeros | 76 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 162.25399 |
| Q1 | 214.62477 |
| median | 292.30769 |
| Q3 | 410.40346 |
| 95-th percentile | 819.53154 |
| Maximum | 7978.9615 |
| Range | 7978.9615 |
| Interquartile range (IQR) | 195.77869 |
Descriptive statistics
| Standard deviation | 333.43406 |
|---|---|
| Coefficient of variation (CV) | 0.90461181 |
| Kurtosis | 159.39583 |
| Mean | 368.59353 |
| Median Absolute Deviation (MAD) | 92.307692 |
| Skewness | 8.9832876 |
| Sum | 1012526.4 |
| Variance | 111178.27 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 76 | 2.8% |
| 307.6923077 | 25 | 0.9% |
| 230.7692308 | 24 | 0.9% |
| 269.2307692 | 24 | 0.9% |
| 384.6153846 | 22 | 0.8% |
| 192.3076923 | 21 | 0.8% |
| 215.3846154 | 12 | 0.4% |
| 276.9230769 | 11 | 0.4% |
| 184.6153846 | 11 | 0.4% |
| 346.1538462 | 10 | 0.4% |
| Other values (2360) | 2511 |
| Value | Count | Frequency (%) |
| 0 | 76 | |
| 1.153846154 | 1 | < 0.1% |
| 117.3846154 | 1 | < 0.1% |
| 123.1 | 1 | < 0.1% |
| 129.8023077 | 1 | < 0.1% |
| 132.2076923 | 1 | < 0.1% |
| 137.4062308 | 1 | < 0.1% |
| 138.6440769 | 1 | < 0.1% |
| 139.9797692 | 1 | < 0.1% |
| 140.6102308 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7978.961538 | 1 | |
| 6864.915385 | 1 | |
| 3053.984615 | 1 | |
| 3007.018846 | 1 | |
| 2711.907692 | 1 | |
| 2700.203769 | 1 | |
| 2589.938462 | 1 | |
| 2587.807692 | 1 | |
| 2523.1 | 1 | |
| 2346.221462 | 1 |
| Distinct | 1752 |
|---|---|
| Distinct (%) | 63.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.6 KiB |
| Minimum | 2001-09-11 00:00:00 |
|---|---|
| Maximum | 2016-10-19 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
current_balance_eur
Real number (ℝ)
| Distinct | 1137 |
|---|---|
| Distinct (%) | 41.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3122.6323 |
| Minimum | 0 |
|---|---|
| Maximum | 18176.994 |
| Zeros | 18 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 269.25315 |
| Q1 | 1274.9794 |
| median | 2307.6923 |
| Q3 | 4615.3846 |
| 95-th percentile | 7692.3077 |
| Maximum | 18176.994 |
| Range | 18176.994 |
| Interquartile range (IQR) | 3340.4052 |
Descriptive statistics
| Standard deviation | 2447.3621 |
|---|---|
| Coefficient of variation (CV) | 0.78374968 |
| Kurtosis | 3.1474657 |
| Mean | 3122.6323 |
| Median Absolute Deviation (MAD) | 1517.3077 |
| Skewness | 1.3827727 |
| Sum | 8577871 |
| Variance | 5989581.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2307.692308 | 499 | 18.2% |
| 4615.384615 | 207 | 7.5% |
| 7692.307692 | 92 | 3.3% |
| 2371.153846 | 37 | 1.3% |
| 790.3846154 | 29 | 1.1% |
| 5366.711538 | 21 | 0.8% |
| 1580.769231 | 20 | 0.7% |
| 0 | 18 | 0.7% |
| 3951.923077 | 18 | 0.7% |
| 1185.576923 | 17 | 0.6% |
| Other values (1127) | 1789 |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 2.089923077 | 1 | < 0.1% |
| 18.14769231 | 1 | < 0.1% |
| 75.61676923 | 1 | < 0.1% |
| 78.43846154 | 1 | < 0.1% |
| 84.85615385 | 1 | < 0.1% |
| 92.16 | 1 | < 0.1% |
| 118.8461538 | 1 | < 0.1% |
| 118.8540769 | 1 | < 0.1% |
| 120.6153846 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 18176.99415 | 1 | < 0.1% |
| 18121.03669 | 1 | < 0.1% |
| 17966.65285 | 1 | < 0.1% |
| 16652.01969 | 1 | < 0.1% |
| 15775.59762 | 1 | < 0.1% |
| 15384.61538 | 1 | < 0.1% |
| 14923.75069 | 1 | < 0.1% |
| 14226.92308 | 1 | < 0.1% |
| 13846.15385 | 3 | |
| 13672.18462 | 1 | < 0.1% |
job_stability_years
Real number (ℝ)
High correlation 
| Distinct | 1284 |
|---|---|
| Distinct (%) | 46.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.078648 |
| Minimum | -0.019178082 |
|---|---|
| Maximum | 42.769863 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 21.6 KiB |
Quantile statistics
| Minimum | -0.019178082 |
|---|---|
| 5-th percentile | 1.0136986 |
| Q1 | 4.6794521 |
| median | 8.7315068 |
| Q3 | 11.912329 |
| 95-th percentile | 28.510685 |
| Maximum | 42.769863 |
| Range | 42.789041 |
| Interquartile range (IQR) | 7.2328767 |
Descriptive statistics
| Standard deviation | 7.9101843 |
|---|---|
| Coefficient of variation (CV) | 0.78484576 |
| Kurtosis | 2.304545 |
| Mean | 10.078648 |
| Median Absolute Deviation (MAD) | 3.7643836 |
| Skewness | 1.5022342 |
| Sum | 27686.047 |
| Variance | 62.571016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.731506849 | 711 | 25.9% |
| 6.764383562 | 18 | 0.7% |
| 0.6767123288 | 15 | 0.5% |
| 0.2630136986 | 13 | 0.5% |
| 0.5123287671 | 12 | 0.4% |
| 2.676712329 | 12 | 0.4% |
| 0.597260274 | 10 | 0.4% |
| 10.10136986 | 10 | 0.4% |
| 5.350684932 | 10 | 0.4% |
| 1.01369863 | 10 | 0.4% |
| Other values (1274) | 1926 |
| Value | Count | Frequency (%) |
| -0.01917808219 | 1 | < 0.1% |
| 0.002739726027 | 1 | < 0.1% |
| 0.008219178082 | 1 | < 0.1% |
| 0.01095890411 | 1 | < 0.1% |
| 0.03835616438 | 1 | < 0.1% |
| 0.07397260274 | 1 | < 0.1% |
| 0.08767123288 | 1 | < 0.1% |
| 0.0904109589 | 1 | < 0.1% |
| 0.09315068493 | 4 | 0.1% |
| 0.2630136986 | 13 |
| Value | Count | Frequency (%) |
| 42.76986301 | 1 | |
| 41.75342466 | 1 | |
| 41.25479452 | 1 | |
| 41.04109589 | 1 | |
| 40.79178082 | 1 | |
| 40.78082192 | 1 | |
| 39.98630137 | 1 | |
| 39.97808219 | 1 | |
| 39.78630137 | 1 | |
| 39.29041096 | 1 |
address_stability_years
Real number (ℝ)
| Distinct | 1600 |
|---|---|
| Distinct (%) | 58.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.520802 |
| Minimum | 0.51232877 |
|---|---|
| Maximum | 60 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.6 KiB |
Quantile statistics
| Minimum | 0.51232877 |
|---|---|
| 5-th percentile | 3.3619178 |
| Q1 | 10.852055 |
| median | 23.224658 |
| Q3 | 35.49589 |
| 95-th percentile | 55.066301 |
| Maximum | 60 |
| Range | 59.487671 |
| Interquartile range (IQR) | 24.643836 |
Descriptive statistics
| Standard deviation | 15.633069 |
|---|---|
| Coefficient of variation (CV) | 0.63754313 |
| Kurtosis | -0.62072625 |
| Mean | 24.520802 |
| Median Absolute Deviation (MAD) | 12.372603 |
| Skewness | 0.4964408 |
| Sum | 67358.644 |
| Variance | 244.39285 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 77 | 2.8% |
| 23.22465753 | 61 | 2.2% |
| 16.77260274 | 41 | 1.5% |
| 6.764383562 | 33 | 1.2% |
| 36.78630137 | 31 | 1.1% |
| 11.76712329 | 22 | 0.8% |
| 8.767123288 | 22 | 0.8% |
| 4.764383562 | 21 | 0.8% |
| 9.767123288 | 21 | 0.8% |
| 10.76712329 | 21 | 0.8% |
| Other values (1590) | 2397 |
| Value | Count | Frequency (%) |
| 0.5123287671 | 1 | < 0.1% |
| 0.597260274 | 3 | 0.1% |
| 0.6246575342 | 1 | < 0.1% |
| 0.6767123288 | 4 | 0.1% |
| 0.7150684932 | 1 | < 0.1% |
| 0.7616438356 | 10 | |
| 0.9287671233 | 1 | < 0.1% |
| 1.01369863 | 1 | < 0.1% |
| 1.071232877 | 1 | < 0.1% |
| 1.095890411 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 60 | 77 | |
| 59.78356164 | 1 | < 0.1% |
| 59.68767123 | 1 | < 0.1% |
| 59.55342466 | 1 | < 0.1% |
| 59.52876712 | 1 | < 0.1% |
| 59.52328767 | 1 | < 0.1% |
| 59.26027397 | 1 | < 0.1% |
| 59.24657534 | 1 | < 0.1% |
| 58.82465753 | 1 | < 0.1% |
| 58.8 | 2 | 0.1% |
job_stability_missing
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.6 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2036 | |
| 1 | 711 | 25.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2036 | |
| 1 | 711 | 25.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2036 | |
| 1 | 711 | 25.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2036 | |
| 1 | 711 | 25.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2036 | |
| 1 | 711 | 25.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2036 | |
| 1 | 711 | 25.9% |
address_stability_missing
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 21.6 KiB |
| 0 | |
|---|---|
| 1 | 61 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2686 | |
| 1 | 61 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2686 | |
| 1 | 61 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2686 | |
| 1 | 61 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2686 | |
| 1 | 61 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2686 | |
| 1 | 61 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2747 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2686 | |
| 1 | 61 | 2.2% |
cust_income_log
Real number (ℝ)
High correlation  Zeros 
| Distinct | 2370 |
|---|---|
| Distinct (%) | 86.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6165543 |
| Minimum | 0 |
|---|---|
| Maximum | 8.9846889 |
| Zeros | 76 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 21.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.0953072 |
| Q1 | 5.3735397 |
| median | 5.6812222 |
| Q3 | 6.0195743 |
| 95-th percentile | 6.7099522 |
| Maximum | 8.9846889 |
| Range | 8.9846889 |
| Interquartile range (IQR) | 0.64603457 |
Descriptive statistics
| Standard deviation | 1.076741 |
|---|---|
| Coefficient of variation (CV) | 0.19170847 |
| Kurtosis | 18.028581 |
| Mean | 5.6165543 |
| Median Absolute Deviation (MAD) | 0.32011785 |
| Skewness | -3.7462221 |
| Sum | 15428.675 |
| Variance | 1.1593712 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 76 | 2.8% |
| 5.732345013 | 25 | 0.9% |
| 5.445742182 | 24 | 0.9% |
| 5.599276295 | 24 | 0.9% |
| 5.95484046 | 22 | 0.8% |
| 5.26428318 | 21 | 0.8% |
| 5.377057451 | 12 | 0.4% |
| 5.627344374 | 11 | 0.4% |
| 5.223676708 | 11 | 0.4% |
| 5.849768042 | 10 | 0.4% |
| Other values (2360) | 2511 |
| Value | Count | Frequency (%) |
| 0 | 76 | |
| 0.7672551528 | 1 | < 0.1% |
| 4.773938777 | 1 | < 0.1% |
| 4.821087692 | 1 | < 0.1% |
| 4.873687082 | 1 | < 0.1% |
| 4.891909506 | 1 | < 0.1% |
| 4.930193062 | 1 | < 0.1% |
| 4.939096878 | 1 | < 0.1% |
| 4.948616399 | 1 | < 0.1% |
| 4.95307843 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8.984688871 | 1 | |
| 8.83432465 | 1 | |
| 8.024529836 | 1 | |
| 8.00903695 | 1 | |
| 7.905776288 | 1 | |
| 7.901452793 | 1 | |
| 7.85977543 | 1 | |
| 7.858952698 | 1 | |
| 7.833639843 | 1 | |
| 7.760987551 | 1 |
Interactions
Correlations
| address_stability_missing | address_stability_years | age | cocunut | current_balance_eur | cust_income | cust_income_log | education | employment | gender | job_stability_missing | job_stability_years | marital_status | years_with_bank | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| address_stability_missing | 1.000 | 0.392 | 0.078 | 0.137 | 0.000 | 0.000 | 0.677 | 0.026 | 0.535 | 0.000 | 0.184 | 0.120 | 0.050 | 0.071 |
| address_stability_years | 0.392 | 1.000 | 0.260 | 0.017 | -0.036 | -0.109 | -0.109 | 0.055 | 0.214 | 0.184 | 0.215 | 0.147 | 0.091 | 0.008 |
| age | 0.078 | 0.260 | 1.000 | -0.011 | 0.007 | -0.086 | -0.086 | 0.063 | 0.498 | 0.121 | 0.715 | 0.406 | 0.334 | 0.257 |
| cocunut | 0.137 | 0.017 | -0.011 | 1.000 | 0.044 | -0.026 | -0.026 | 0.015 | 0.040 | 0.000 | 0.000 | -0.001 | 0.024 | 0.008 |
| current_balance_eur | 0.000 | -0.036 | 0.007 | 0.044 | 1.000 | 0.139 | 0.139 | 0.123 | 0.097 | 0.069 | 0.139 | -0.005 | 0.023 | 0.157 |
| cust_income | 0.000 | -0.109 | -0.086 | -0.026 | 0.139 | 1.000 | 1.000 | 0.117 | 0.022 | 0.050 | 0.000 | 0.183 | 0.050 | 0.277 |
| cust_income_log | 0.677 | -0.109 | -0.086 | -0.026 | 0.139 | 1.000 | 1.000 | 0.178 | 0.283 | 0.084 | 0.205 | 0.183 | 0.020 | 0.277 |
| education | 0.026 | 0.055 | 0.063 | 0.015 | 0.123 | 0.117 | 0.178 | 1.000 | 0.112 | 0.123 | 0.096 | 0.061 | 0.019 | 0.126 |
| employment | 0.535 | 0.214 | 0.498 | 0.040 | 0.097 | 0.022 | 0.283 | 0.112 | 1.000 | 0.115 | 0.851 | 0.453 | 0.229 | 0.121 |
| gender | 0.000 | 0.184 | 0.121 | 0.000 | 0.069 | 0.050 | 0.084 | 0.123 | 0.115 | 1.000 | 0.045 | 0.088 | 0.230 | 0.062 |
| job_stability_missing | 0.184 | 0.215 | 0.715 | 0.000 | 0.139 | 0.000 | 0.205 | 0.096 | 0.851 | 0.045 | 1.000 | 0.729 | 0.296 | 0.081 |
| job_stability_years | 0.120 | 0.147 | 0.406 | -0.001 | -0.005 | 0.183 | 0.183 | 0.061 | 0.453 | 0.088 | 0.729 | 1.000 | 0.197 | 0.315 |
| marital_status | 0.050 | 0.091 | 0.334 | 0.024 | 0.023 | 0.050 | 0.020 | 0.019 | 0.229 | 0.230 | 0.296 | 0.197 | 1.000 | 0.088 |
| years_with_bank | 0.071 | 0.008 | 0.257 | 0.008 | 0.157 | 0.277 | 0.277 | 0.126 | 0.121 | 0.062 | 0.081 | 0.315 | 0.088 | 1.000 |
Missing values
Sample
| cocunut | age | years_with_bank | marital_status | education | employment | gender | cust_income | current_with_bank_date | current_balance_eur | job_stability_years | address_stability_years | job_stability_missing | address_stability_missing | cust_income_log | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 80001 | 32 | 3 | M | HGH | PVE | M | 423.076923 | 2014-07-02 | 143.000000 | 11.767123 | 31.726027 | 0 | 0 | 6.049915 |
| 1 | 80002 | 51 | 10 | W | OTH | RET | F | 140.610231 | 2007-02-21 | 2288.700154 | 8.731507 | 5.956164 | 1 | 0 | 4.953078 |
| 2 | 80003 | 36 | 7 | M | BCR | STE | F | 326.923077 | 2009-10-26 | 2268.491692 | 7.073973 | 34.994521 | 0 | 0 | 5.792779 |
| 3 | 80004 | 46 | 11 | D | BCR | MISC | F | 738.820000 | 2005-11-30 | 4536.983462 | 10.934247 | 8.098630 | 0 | 0 | 6.606407 |
| 4 | 80005 | 39 | 10 | M | HGH | PVE | M | 483.928231 | 2006-12-05 | 3076.923077 | 3.347945 | 23.109589 | 0 | 0 | 6.184001 |
| 5 | 80006 | 64 | 9 | M | BCR | RET | F | 274.515385 | 2008-06-16 | 153.769231 | 23.104110 | 33.369863 | 0 | 0 | 5.618643 |
| 6 | 80007 | 64 | 8 | W | HGH | RET | F | 237.634000 | 2009-06-10 | 2307.692308 | 8.731507 | 38.038356 | 1 | 0 | 5.474931 |
| 7 | 80008 | 74 | 11 | M | HGH | RET | M | 221.043077 | 2006-05-26 | 790.384615 | 8.731507 | 19.972603 | 1 | 0 | 5.402871 |
| 8 | 80009 | 26 | 4 | S | HGH | PVE | M | 248.646154 | 2012-11-29 | 4615.384615 | 4.701370 | 7.013699 | 0 | 0 | 5.520045 |
| 9 | 80010 | 42 | 13 | M | BCR | PVE | F | 228.076923 | 2004-06-29 | 4820.321462 | 3.887671 | 41.131507 | 0 | 0 | 5.434058 |
| cocunut | age | years_with_bank | marital_status | education | employment | gender | cust_income | current_with_bank_date | current_balance_eur | job_stability_years | address_stability_years | job_stability_missing | address_stability_missing | cust_income_log | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2737 | 82738 | 30 | 4 | M | HGH | PVE | F | 215.384615 | 2013-06-25 | 2307.692308 | 4.600000 | 2.432877 | 0 | 0 | 5.377057 |
| 2738 | 82739 | 67 | 8 | M | HGH | RET | F | 253.234846 | 2009-01-27 | 4742.307692 | 8.731507 | 35.536986 | 1 | 0 | 5.538258 |
| 2739 | 82740 | 51 | 12 | M | BCR | STE | F | 553.846154 | 2005-06-09 | 300.680769 | 19.358904 | 49.912329 | 0 | 0 | 6.318691 |
| 2740 | 82741 | 65 | 8 | M | HGH | RET | M | 202.115077 | 2009-05-06 | 790.384615 | 8.731507 | 40.616438 | 1 | 0 | 5.313773 |
| 2741 | 82742 | 36 | 14 | S | HGH | PVE | F | 395.209000 | 2003-03-26 | 2307.692308 | 9.339726 | 11.271233 | 0 | 0 | 5.981942 |
| 2742 | 82743 | 53 | 5 | M | OTH | PVE | F | 183.461538 | 2012-05-04 | 5407.524308 | 10.265753 | 36.320548 | 0 | 0 | 5.217441 |
| 2743 | 82744 | 56 | 8 | M | HGH | STE | F | 276.923077 | 2009-02-27 | 253.459231 | 9.479452 | 9.479452 | 0 | 0 | 5.627344 |
| 2744 | 82745 | 67 | 5 | D | BCR | RET | F | 148.721154 | 2012-05-28 | 2371.153846 | 19.153425 | 4.095890 | 0 | 0 | 5.008775 |
| 2745 | 82746 | 48 | 1 | M | HGH | STE | F | 258.561538 | 2015-12-15 | 2307.692308 | 17.775342 | 28.780822 | 0 | 0 | 5.558994 |
| 2746 | 82747 | 44 | 14 | M | HGH | PVE | M | 682.954154 | 2002-12-20 | 7692.307692 | 10.767123 | 42.898630 | 0 | 0 | 6.527891 |